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ABSTRACT 

The Milky Way's most luminous, young and massive (M > 10^ M©) star clusters and OB associations 
have largely evaded detection despite knowledge of their surrounding H II regions. We search for these 
clusters and associations within the 40 star forming complexes from Rahman & Murray in the 13 most 
luminous WMAP free-free emission sources of the Galaxy. Selecting for objects with the dust-reddened 
colors of OB stars, we identify new candidate associations using the 2MASS point source catalog. In 
40 star forming complexes searched, 22 contain cluster/association candidates with sizes and masses 
in the range of 3'-26' and 10^-^-10^ M©. Of the 22 candidates, at least 7 have estimated masses > 10^ 
M0, doubling the number of such massive clusters known in the Galaxy. Applying our method to a 
statistically similar set of test locations, we estimate that 3.0 ± 0.6 of our 22 candidate associations 
are unrelated to the star forming complexes. In addition, the apparent extinctions of our candidate 
associations correlate well with the predictions from a Galactic model. These facts, along with the 
clear detection of a known OB association and the previous spectral verification of one cluster found 
by this method, validate our method. Only one of the searched WMAP sources remains without a 
candidate. In 8 of the most luminous WMAP sources, the candidate associations can account for 
the observed free-free flux. With our new compilation, the Galactic census of young, massive stellar 
associations may now be about half complete. 

Subject headings: open clusters and associations — stars: massive — stars: formation — infrared: 
stars 



1. INTRODUCTION 

A spiral galaxy's metabolism is dominated by the ef- 
fects of its most massive (M > 10^ M©) young stellar 
clusters and OB associations. Not only are these clusters 
and OB associations the birthplaces of a substantial frac- 
tion of a galaxy's stars, but they also dominate the en- 
ergetic feedback within the galactic disk and drive disk- 
halo interactions through the production of superbubbles 
(McKee & Wilhams 1997; Chu 2008; Weaver et al. 1977; 
Norman & Ikeuchi 1989; Reynolds et al. 2001). While 
the upper end of the cluster mass function (CMF) has 
been derived through Galactic H II region observations 
(McKee & Williams 1997; Murray & Rahman 2010), the 
powering clusters of the H II regions have mostly evaded 
detection. This is primarily due to the large distance 
and extinguishing column through the Galactic disk. Al- 
though a number of members of the upper end of the 
CMF have been identified and characterized, including 
NGC 3603 (Stolte et al. 2004) and the Galactic Center 
clusters (Figer et al. 1999), the census of the upper end is 
far from complete. This poses a problem for any global 
studies of the Galaxy's energy budget, star formation 
properties, turbulence, and overall ecology. Additionally, 
more complete knowledge of the upper end of the CMF is 
required for detailed comparison to other galaxies, criti- 
cal to further understanding the role of star formation in 
galaxy evolution (Whitmore et al. 2007; Larsen 2009). 
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Recent work by Rahman & Murray (2010, here- 
after RMIO) with data from the Wilkinson Microwave 
Anisotropy Probe (WMAP) and the Spitzer GLIMPSE 
survey has identified 40 star forming complexes (SFCs) 
within the Galaxy's most luminous free-free emission 
sources. Only one of the SFCs has a previously iden- 
tified powering cluster (NGC 3603), indicating that the 
complexes may be harboring unknown luminous young 
massive clusters and OB associations. Notably, Rah- 
man et al. (2011a) identified a candidate association for 
the most luminous SFC (the Dragonfish Nebula) based 
on the density of near- infrared (NIR) point sources, fil- 
tering non-member field stars using an extinction-based 
color-cut. The candidate association is located 9.7 kpc 
through the Galactic disk with a line-of-sight extinction 
1.0 < Ak < 1.4 mag. Without the NIR color selection 
process, the association would remain hidden to visual 
or automated cluster search methods. Rahman et al. 
(2011b) confirmed this association with NIR spectra of 
a sample of its brightest members. 

In this paper, we expand upon the method of Rahman 
et al. (2011a) to search for young massive clusters or OB 
associations in all of the SFCs identified in RMIO. In §2, 
we discuss previous cluster searches. In §3, we outline 
the nature of the target SFCs. In §4, we discuss the use 
of the Two Micron All-Sky Survey (2MASS) for cluster 
searches and present the cluster search technique. In §5, 
we present our new candidate clusters and associations, 
discuss their statistical significance, and test the method 
through which they are identified. In §6, we estimate 
masses and luminosities for the candidates. In §7, we 
produce and discuss the most recent census of Galactic 
young massive clusters and OB associations. We sum- 
marize our conclusions in §8. 
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2. PREVIOUS CLUSTER SEARCHES 

Star clusters have historically been discovered in the 
visible wavelengths. However, these discoveries have 
been limited to the closest cluster populations due to the 
substantial dust obscuration through the disk. The ad- 
vent of large NIR surveys has enabled the search for star 
clusters and OB associations despite this obscuration. 
Specifically, the 2MASS (1.2-2.2 /im) and the Spitzer 
Galactic Legacy Infrared Mid-Plane Survey Extraordi- 
naire (GLIMPSE, 3.6-8.0 /im) have produced catalogs 
covering large areas of the Galactic plane in wavelengths 
where stellar flux dominates but extinction effects are 
minimized, although not completely eliminated (Skrut- 
skie et al. 2006; Benjamin et al. 2003). 

Dutra et al. (2003) and Bica et al. (2003) have sys- 
tematically searched for star clusters in 2MASS, focused 
around optical and radio nebulae-primarily H II regions. 
Their search used visual identiflcation of clustering on 
small images (5^- 15' square) centered around the neb- 
ulae. Though this process, 346 new cluster candidates 
were identified. However, these efforts only searched for 
clusters around known nebulae, missing those not coin- 
cident with nebular emission (see §3). In a more au- 
tomated manner, Froebrich et al. (2007) used the to- 
tal on-sky stellar density of 2MASS point sources within 
\b\ < 20° to identify 1788 cluster candidates, of which 
1021 are new. This method is less biased towards nebula- 
coincident clusters, but more susceptible to false pos- 
itives. They estimated that their contamination rate 
of chance overdensities within their catalog is approxi- 
mately 50%. In both these cases, the searches are most 
sensitive to dense clusters, rather than diffuse regions. 
For instance, Froebrich et al. (2007) only detected a small 
clustered region within Cygnus 0B2, rather than the en- 
tire OB association with a half-light radius of 13^ which 
is readily identifiable with 2MASS (Knodlseder 2000). 
Over 96% of the Froebrich et al. (2007) catalog candi- 
dates have a total central cluster density of greater than 
four sources arcminute"^, and over half greater than 10 
sources arcminute"^. For the highly veiled case where 
only the most luminous stars are visible, typical source 
densities of 10^ Mq OB associations are expected to be 
- 1.0(i?/10 pc)-'^{D/10 kpc)°-^^ sources arcminute-^, 
where R and D are the association radius and distance 
(Rahman et al. 2011a). For comparison, the typical stel- 
lar density of a 2MASS field in the Galactic Plane is 
between 14 and 17 sources arcminute"^ (§4-1)- Conse- 
quently, these previous 2MASS searches are insensitive to 
more dispersed OB associations. Similarly, Mercer et al. 
(2005) identified 92 new cluster candidates using similar 
automated and visual identification methods with the 
GLIMPSE point source catalog. Ah of the GLIMPSE 
cluster candidates have diameters of 3' or less, indicat- 
ing the same bias towards dense clusters. Since there are 
likely many more associations with diameters > 3', such 
as Cygnus OB 2, a systematic search optimized towards 
larger, more massive OB associations is required to find 
them. 

For the candidate clusters identified in the works of 
Dutra et al. (2003), Bica et al. (2003), and Mercer et al. 
(2005), further photometric and spectroscopic studies 
have found high rates of false positive clusters (Borissova 
et al. 2005; Ivanov et al. 2005). Those clusters that are 



confirmed tend to have masses in the range of 10-1,000 
Mq and contain no stars with spectral types earlier than 
B, so cannot be significant ionization sources (Hanson & 
Bubnick 2008; Soares et al. 2008). This suggests that 
most of the galaxy's largest ionizing clusters have not 
been identified. 

The recent work by Bica & Bonatto (2011) indicates 
that when candidate clusters are poorly populated or 
projected against dense stellar backgrounds, discerning 
a real cluster from a field density fiuctuation (or aster- 
ism) is difficult or impossible. The effect poses a prob- 
lem in the search for the most luminous clusters in the 
Galaxy located at significant distances; only the bright- 
est members would be visible due to the substantial ex- 
tinction and distances through the Galactic disk, even 
in the infrared regime. This creates the appearance of 
a poorly populated cluster. Further, most, if not all, 
of these clusters and associations should be located in- 
side the Galactic disk, leading to a substantial confusing 
stellar background population. Therefore, these previ- 
ous searches are best suited to finding nearby, compact 
clusters far from the Galactic mid-plane. 

In summary, previous automated cluster searches are 
sensitive to compact clusters, like NGC 3603, but not 
to more diffuse associations like Cygnus 0B2 and the 
Dragonfish. To thoroughly catalog the Galaxy's most 
luminous clusters using NIR surveys, one must exclude 
a large fraction of the field giants. 

3. WMAP-IDENTIFIED STAR FORMING COMPLEXES 

To identify the locations of the most luminous OB asso- 
ciations, we use the identification of their parental SFCs 
from the WMAP and Spitzer GLIMPSE survey (Murray 
& Rahman 2010, RMIO). The WMAP mission provides 
all-sky, high sensitivity (~ /iK) microwave maps with five 
bands in the 23.5 — 90 GHz range with a relatively large 
(^ 1°) beam size. The five bands permit the separation 
of each of the Galactic "foreground" emission compo- 
nents; synchrotron emission, thermal dust emission, and 
free-free emission. The free-free emission almost entirely 
arises from reprocessed ionizing photons produced by the 
young, luminous (and massive) stars that are found en 
masse in the most luminous young clusters and OB as- 
sociations. Murray & Rahman (2010) identify free- free 
emission sources through the WMAP foreground map, 
finding that 14 sources are responsible for one-third of 
the total galactic ionizing luminosity. These regions are 
expected to be powered by young, very massive clusters 
or OB associations that contain sufficient 0-type stars 
to produce the observed free-free emission. Within these 
14 sources, only two have had previously identified pow- 
ering clusters and associations, specifically the Galactic 
Center region and NGC 3603 (Murray & Rahman 2010). 
Also notable is that many of the previously known mas- 
sive OB associations, such as Carina and Cygnus 0B2, 
are not enclosed by these 13 most luminous regions. This 
suggests the existence of ionizing clusters or associations 
more luminous than those previously identified. 

The large (> 1°) radii of the WMAP sources leads 
to significant confusion along the line of sight, espe- 
cially in the sources towards the inner Galaxy. RMIO 
use the Spitzer GLIMPSE 8 jam emission maps and ra- 
dio recombination line radial velocities to separate the 
large WMAP sources into SFCs for the 13 most lumi- 
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nous sources from Murray & Rahman (2010), excluding 
the Galactic Center region. The observed SFCs differ 
in structure from the traditional model of a Stromgren 
sphere H II region: the complexes often appear as limb- 
brightened shells with the strongest nebular emission 
tracing large shells, typically 5' to 30' in radius, rather 
than in the central region. This gives the impression 
that a powerful central association has evacuated a zone 
of neutral gas, which it illuminates and ionizes. The 
brightest knots of emission in these shells, which often 
coincide with previously identified H II regions, could 
represent sites of triggered massive star formation. Fur- 
ther, Murray & Rahman (2010) find that the majority of 
ionizing photons escape from the shells and produce dif- 
fuse ionization surrounding the entire region, often with 
radii > 1°, an effect previously inferred in the literature 
(Anantharamaiah 1985; McKee & WiUiams 1997). While 
the classical Stromgren sphere interpretation of the SFC 
structure would suggest that the powering source should 
be located towards the center of the bright emission, 
this scenario suggests that the cluster is not expected 
to be coincident with the nebular emission for any but 
the youngest and smallest complexes. 

Within the 13 investigated WMAP sources, RMIO 
identify 40 discrete SFCs with distances ranging from 
2.5 to 15 kpc and semimajor axes between 2' and 67^ 
In some cases, the kinematic distance ambiguity is not 
resolved and two possible distances are identified. Since 
the WMAP sources are distant and projected against the 
crowded Galactic plane, we expect a great deal of confu- 
sion along the line of sight: while the inferred luminosi- 
ties of these sources suggest that they contain massive 
clusters, they also likely contain a number of less-massive 
sources along the line of sight. Due to the large fraction 
of ionizing photons escaping the central area of the com- 
plex to radii similar to the distances between complexes, 
we cannot disentangle the different flux components from 
each of the sources. This makes it difficult a priori to de- 
termine which of the complexes are home to the massive 
clusters using ionization-based measures. In this work, 
we search for massive clusters and associations in all of 
the identified SFCs. 

Previously, we have directed our attention to the most 
luminous of the SFCs, the Dragonfish Nebula. We de- 
veloped a method using the 2MASS point source cata- 
log to identify a candidate for the Galaxy's most lumi- 
nous OB association, with a total stellar mass of 10^ Mq 
(Rahman et al. 2011a). Follow-up NIR spectroscopy of 
bright stars within the candidate association confirmed 
the existence of Dragonfish Association with membership 
statistics consistent with the candidate's expected value 
(Rahman et al. 2011b). This example lends credence to 
the NIR color selection method as a means to identify 
candidate clusters. In this work, we use the method of 
Rahman et al. (2011a) to identify candidate clusters and 
OB association within the remaining SFCs from RMIO. 

4. THE SELECTIVE STAR COUNT METHOD 

We refine the method of Rahman et al. (2011a) to iden- 
tify candidate clusters and associations within the SFCs 
using the line-of-sight extinction through the Galactic 
plane. This filters non-cluster stars from candidate mem- 
bers, allowing the associations to be identified in areas 
of significant point source contamination. Schematically, 



the process is as follows: We isolate stars matching the 
NIR colors of 0-type stars at graduated extinction levels. 
We then use the on-sky density of point sources matching 
the color cut to find overdensities consistent with the lo- 
cation of the SFC. Verifying from binned color-color and 
color-magnitude diagrams that the candidate cluster is 
not caused by a chance overdensity of projected Galac- 
tic structure based on the color distribution along the 
line of sight, we extract the candidate cluster's member- 
ship properties. We refer to this method as the Selective 
Star Count (SSC) method, which we describe in greater 
detail. 

4.1. The 2MASS Point Source Catalog 

To identify members of the candidate cluster, we use 
the 2MASS Point Source Catalog (Skrutskie et al. 2006). 
The survey covers the entire sky with J (1.25 /xm), H 
(1.65 /im), and Ks (2.16 /im) bands, enabling con- 
sistent coverage of the SFCs in both the northern and 
southern skies. We limit our searches to the point sources 
with photometric quality flags of A, B, or C, correspond- 
ing to a minimum signal-to-noise ratio of 5 for all of the 
J, and Ks bands. The 2MASS catalogue is complete 
to J < 15.8, H < 15.1, and Ks < 14.3 magnitudes in 
the absence of confusion. However, the areas we investi- 
gate are highly confused causing the completeness limit 
to be up to 1 magnitude brighter. This prevents the de- 
tection of less luminous objects that are expected to be 
cluster members based on the stellar initial mass function 
(IMF). 

The typical FWHM of the 2MASS point sources is 2'.'5 
and quoted magnitudes are measured with apertures of 
4'^, which will cause significant confusion for the most 
compact clusters or the dense areas of more diffuse OB 
associations (Skrutskie et al. 2006). Given the selected 
quality limit, these densely-packed confused sources will 
be excluded from our searches. Based on this constraint, 
we expect our search method to be insensitive to clus- 
ters with on-sky point source densities ^31 arcminute"^ 
due to overlapping apertures causing confusion. Within 
1° of the Galactic midplane, we find a total density of 
point sources in the target areas to be between 14 and 17 
arcminute"^ meeting the chosen quality cut. Therefore 
we expect such compact clusters to be readily identified 
by visual inspection of the images. In this sense, our SSC 
method is entirely complementary to search strategies of 
Bica et al. (2003), Dutra et al. (2003) and Mercer et al. 
(2005). We discuss this further in the case of NGC 3603 
in §5.1.1. 

4.2. Extinction- derived Color Selection 

The hypothesized powering clusters and associations 
of the SFCs should exist at large extinctions due to their 
distance through the Galactic disk. We make use of this 
to identify the cluster candidates on the basis that all 
cluster members should be similarly extinguished along 
the line of sight. Our method for uncovering candidate 
clusters is to identify overdensities of stars with colors 
consistent with massive stars at extinction ranges con- 
sistent with the distances to the SFC being probed. 

All of the SFCs were identified using techniques sen- 
sitive to ionizing flux with large ionizing luminosities 
{Qo ^ 10^^-^ s~-^). The most luminous stars near the 
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stellar upper mass limit have significantly smaller ioniz- 
ing outputs (Qo = 10^^-^ s~^ for an 03V star; Martins 
et al. 2005). The measured ionizing luminosities there- 
fore require very large numbers of 0-type stars to power 
them. For this reason we specifically search for overden- 
sities of 0-type stars with magnitudes and colors consis- 
tent with the distances and expected extinctions to the 
SFCs. 

We use the NIR 0-type star magnitudes and colors 
from Martins & Plez (2006). We note that the colors 
of all 0-type stars in J — H and H — Ks are constant, 
with J — H = —0.11 and H — Ks = —0.10, as expected, 
since these bands are in the Rayleigh- Jeans tail at the 
surface temperatures of 0-type stars. These colors are, 
however, not unique to 0-type stars; B- and early A- 
type stars are indistinguishable from 0-type stars based 
on NIR colors alone to the accuracy of the 2MASS color 
measurements (Rahman et al. 2011a), and later stars are 
relatively more abundant; there is a roughly 8:1 ratio be- 
tween O and B-type stars (Kroupa 2001). Only through 
their magnitudes can we make a distinction between O- 
, B- and A-type stars, given a particular distance and 
extinction. 

We take the intrinsic colors of 0-type stars and red- 
den them to arrive at our 2MASS color cuts. For 
the extinction law, we adopt that of Nishiyama et al. 
(2009), which gives the extinction ratios ^ = 2.89 and 

^ = 1.62. We note that this is a departure from the 
canonical values for the extinction ratios due to a dif- 
ferent power law slope for the relationship between ex- 
tinction and wavelength {Ax ex A"); this determination 
uses a slope a = —2.0, while the previously accepted 
value is a = —1.61 (Cardelli et al. 1989). We adopt the 
Nishiyama relationship over the canonical as it is empiri- 
cally determined from a more highly extinguished sample 
of stars that are more likely to be consistent with the ex- 
tinctions that we probe in this work. The shallower slope 
has been found to be erroneous by Stead & Hoare (2009), 
who use an even steeper power law slope of a = —2.14. 

The steeper slope of the extinction law changes the 
relationship between the NIR (> 1 jam) and visual (~ 
0.5 /im); rather than a ratio of Ay/Ax ~ 10, the new 
ratio becomes Ay/Ax ~ 16 (Nishiyama et al. 2008). 
However, the difference in the extinction ratios between 
the different laws in the NIR (1 — 2.5 /im) bands is less 
than 20%, which is inconsequential in the candidate clus- 
ter identification since we use only the NIR bands and 
the large color cuts used for the selection procedure. 

We use the following relations for A^, the estimated 
0-star extinction in the K-band: 

^k{J -H) = 0.797( J - i7) + 0.0877 (1) 
A'j^\h - Ks) = IM{H - Ks) + 0.166 (2) 

We use these relations to select point sources consistent 
with 0-type stars in a particular extinction range. 

4.3. The Nearest Neighbour Algorithm 

We analyze the density structure of 2MASS point 
sources using a nearest neighbour method, as described 
by Casertano & Hut (1985). This method samples each 
of the points within the chosen color cut, providing an 
adaptive resolution, sampling the areas with more point 
sources more heavily than those areas without point 



sources. The primary advantage of this method over 
a simple spatial density map is that the resolution is 
adaptive; areas with large numbers of point sources will 
contain detail of density structures at a higher resolution 
than areas with a sparsity of sources. From each of the 
points, we measure the distance vj to the j-th nearest 
neighbour. We determine the local surface density, /ij, 
with an unbiased estimator: 



i-1 



(3) 



with j > 3. From Casertano & Hut (1985), the standard 
deviation of the estimator is: 



(4) 



To ensure the consistent visual appearance of the point 
source density variations regardless of the number of 
sources in the field, the specific value of j to be used is a 
function of the total number of point sources in the field. 
If a constant j value was used for all fields, those with 
large numbers of point sources would sample only very 
local variations of the surface density, while those with 
few point sources would smooth over all but the largest 
density structures in the field. The on-sky point source 
density varies based on the location in the Galactic plane, 
the color-cut chosen, and the width of the color-cut. The 
typical on-sky densities of the point sources can vary by 
an order of magnitude based on different color cuts since 
fewer stars with very red colors are cataloged in 2MASS 
than bluer stars. Consequently, using a constant j value 
over all color cuts would dramatically change the scale 
of structure highlighted. So for each cut, we determine 
an optimal value for j given a angular resolution based 
on the average number of point sources within a field: 



I ^cr^* I 
L Q2 J 



(5) 



where A^* and O are the total number of stars in and 
angular radius of the color-selected 2MASS field, and 
Ocr is the characteristic resolution, corresponding to the 
mean resolution across the selected field. For the purpose 
of this investigation, we choose a characteristic resolu- 
tion Ocr = 3.5' based on the resolution used to identify 
the Dragonfish Association from Rahman et al. (2011a). 
At this characteristic resolution, typical j values range 
from 15 to 30 for most regions with extinction cuts of 
A Ax = 0.1. The densities are gridded onto an oversam- 
pled {d < 11^'), uniformly-spaced grid using a Delaunay 
triangulation method from the Matplotlib python pack- 
age (Hunter 2007) to simplify the analysis. 

4.4. Candidate Identification 

For each of the SFCs, we examine the surrounding 2° 
field with 0.1 steps in the A^ color cut in the range of 
0.0 <A^x < 2.0. We enlarge the surrounding field to 3° 
for SFCs with semi-major axes larger than 30'. In each 
extinction step, we produce the nearest neighbour on- 
sky density map, and visually identify cluster candidates 
coincident with the on-sky position of the SFC position 
and the 8 jam morphology from RMIO. 

Using small steps in the color cut allows for effective 
filtering of field stars from candidate clusters: while the 
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typical point source density of unfiltered sources ranges 
from 14 to 17 sources arcminute"^ (§4.1), the typical 
field densities within the color cuts range from 0.05-0.5 
sources arcminute"^. The substantial decrease in the 
field source density due to the color filtering enables the 
detection of more diffuse clustering than has been possi- 
ble in previous searches (§2). 

Our adoption of sequential extinction cuts over the en- 
tire range allows for the filtering of effects due to the 
structure of the Galaxy itself. For instance, a significant 
effect of Galactic structure is the appearance of patchy 
but mostly latitude-dependent stratification of the source 
density above and below the Galactic plane caused by 
the limited thickness of the extinction-producing gas disk 
and also by the appearance of all thick disk and halo stars 
at a particular color range (Figure 1). These large scale 
effects can be visually separated from localized overden- 
sities with ease. 

Once a candidate is identified, the extinction range is 
refined to enhance the prominence of the overdensity, and 
a final density map is produced. From this map we vi- 
sually identify the candidate's geometric boundaries, in- 
cluding its centroid position, semi-major and semi-minor 
axes. Once the boundaries are chosen, we determine the 
total number of enclosed sources, Ntot- To constrain 
field contamination, we measure the mean (Nbg) and 
standard deviation {(Jbg) of the number of field stars 
surrounding the candidate in regions with the same area 
as the identified candidate. 

To better distinguish between real clusters and as- 
terisms, we analyze each candidate cluster with binned 
color-color and color-magnitude diagrams as in Rahman 
et al. (2011a). Both sets of diagrams allow us to dis- 
tinguish between a true overdensity of sources (evidence 
for the existence of a cluster or association) or a shift 
of sources from one color to another (evidence of a local 
extinction feature). From these diagnostics, we find that 
none of the overdensities appear to be shifts of points in 
color-color or color-magnitude space, which is additional 
evidence against the alternative hypothesis of a local ex- 
tinction feature. 

5. CANDIDATE CLUSTERS/ASSOCIATIONS 

In Table 1, we present the candidate clus- 
ters/associations identified within the SFCs. For 
SFCs with multiple candidates, an alphabetical key is 
used to distinguish between them. To quantitatively 
measure the statistical significance of the extracted 
clusters as compared to the natural field variations, we 
define the extraction significance: 



ctbg 

with Ntot^ Nbg^ and a eg as defined in §4.4. 

Two of the candidate associations (those within SFC 
8 and 30) have extraction significance values of less than 
unity; these candidates are identified solely based on 
their morphology as compared to the SFC and have 
significant large scale structure in the field surrounding 
them that produces large background variations. 

The candidates have characteristic radii between 3' and 
20', with a median of ^ 7'. This is consistent with the 
sensitivity of the SSC method to larger diffuse structures 



than in previous cluster searches (see §4.4). Accordingly, 
these candidates would not have been identified in the 
previous searches. While the SSC method is more sensi- 
tive to diffuse structures, the ellipses are chosen to con- 
tain the most prominent areas of the overdensity and 
do not necessarily represent the outer boundaries of the 
clusters. We expect that K-giants are the primary con- 
taminants of all the new candidates, since they tend to 
be the dominant population of visible red stars in the 
2MASS survey (Robin et al. 2003). 

The candidate boundaries are visually chosen to en- 
compass the most centrally-condensed, smooth portions 
of the overdensity, avoiding the lower isodensity contours, 
which are more flocculent, when possible. This corre- 
sponds to an extraction contour of l-a to 2.8-(7 above 
the local background variation level. We investigate any 
bias in the visual identification of candidates, the chosen 
extraction contour, and their effects on their extracted 
parameters with a suite of tests described in Section 5.3. 

We present the stellar density diagrams and the 8 /im 
images from the Spitzer GLIMPSE or Midcourse Space 
Experiment (MSX) in Figures 2 to 8. These images in- 
dicate the SFC location from RMIO and the candidate 
cluster location. 

Many of the candidates are not centered on the nebu- 
losity visible in the 8 /im images. This may be an indi- 
cation of the age of the cluster; if the cluster is young, 
it would not have had sufficient time to evacuate its sur- 
rounding material. Consequently, it would appear at the 
same location as the 8 /xm emission. If the cluster is 
older, it would have evacuated its surrounding material 
and likely blown a bubble. A cluster in this stage would 
appear separate from the material it is illuminating. In 
addition, the SFCs identified by RMIO are visually iden- 
tified and may not correctly reflect the location or the 
size of the SFC based on the limited data. If confirmed, 
the candidate clusters will enable the refinement of the 
SFC parameters. 

Since the candidate extinctions were determined inde- 
pendently from the position and distance to their host 
SFC, we can use the modeled extinction from Marshall 
et al. (2006) as a consistency check. We present the com- 
parison of the candidate cluster extinction ranges with 
the modeled extinction in Figure 9. To determine the 
model extinctions, we use the SFC kinematic distance 
form RMIO in Table 1. We also indicate the possible 
model extinctions of SFCs without a kinematic distance 
ambiguity solution and the location of the Dragonfish 
association from Rahman et al. (2011a). We find the 
correlation coefficient of the two extinctions to be 0.75 
when using the more consistent distance in the cases of 
the distance ambiguity. The correlation is not simply the 
result of choosing the most consistent distance, however. 
We have randomly shuffled the inferred and modeled val- 
ues of Ak flve thousand times, choosing, for each shuf- 
fling, the most consistent model extinction. The results 
are shown in Figure 10: without shuffling, the ratio of 
modeled to observed extinction is l.lzb 0.22 dex (stan- 
dard deviation in the logarithm); with shuffling, the ra- 
tio becomes 0.9 ± 0.36 dex. The increased scatter in 
the shuffled sample strongly supports the validity of the 
candidate clusters, at least on a statistical basis. This 
correlation provides strong evidence for the reality of the 
candidate clusters. For sources with a distance ambigu- 
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Figure 1. 2MASS Point Source Density Maps at the location of SFC 31 (white circle) with extinction cuts between = 1-0 and 1.6 (left 
to right, top to bottom). The color bar indicates the point source density in sources arcminute"^. Galactic structure effects produce the 
long horizontal streak-like features that appear in the first extinction cut symmetrically 1° above and below the galactic plane and narrow 
in subsequent cuts. These features are visually filtered from the selection process. Despite the appearance of compact overdensities, these 
cuts do not contain any identified candidate clusters, since they are likely higher-density pockets associated with the Galactic structure 
features. This can be compared to the candidate identified in this region at a lower extinction cut (Table 1; Figure 6). 



ity, the modeled extinction provides support in favour of 
one of the two possibihties. We find this correlation to be 
strikingly better than the correlations between extinction 
and distance. 

5.1. Previously Identified Clusters 

An important test of the success and the sensitivity 
of the SSC method is identifying known massive clusters 
and associations within 2MASS. For this purpose, we use 
the cases of NGC 3603, the powering cluster of SFC 28, 
and Cygnus OB 2. We discuss these two cases below. 

5.1.1. NGC 3603 

A notable cluster that was not detected by the SSC 
method is the massive cluster within NGC 3603 that 
powers SFC 28. This very compact cluster is mostly 
contained within 33'^, requiring high spatial resolution 
to discern individual members (Stolte et al. 2004). At 
the resolution of 2MASS, the central cluster is highly 
confused, as can be seen in Figure 11. This prevents indi- 
vidual sources from being identified, so they are missing 
from the 2MASS PSC. In fact, only 17 point sources are 
identified in the central region that pass the desired qual- 
ity cut for the SSC method (inset of figure 11). Further 
filtering point sources to those with colors corresponding 
to 0-type stars at the extinction of NGC 3603, we are 



restricted to 12 point sources. Although these 12 sources 
do represent a local maximum in the color-selected source 
density within the cluster's boundary, this is neither sig- 
nificant when compared with the background variation, 
nor representative of the cluster's true stellar population. 
In redder color cuts, this region is seen as a void in point 
source density. This is consistent with our expectation 
that the SSC method is insensitive to compact clusters, 
such as those that can be easily identified visually. 

Despite this, the SSC method is useful to investigate 
the entire area of the NGC3603 natal cloud for a diffuse 
population surrounding the dense cluster. We suspect 
that these are the candidate clusters we identify in SFC 
28, on the edge of the region. They appear at a signif- 
icantly higher extinction than NGC 3603 (AA^ ^ 0.4). 
If they are not unrelated regions farther across the Galac- 
tic plane, they could be regions deeply embedded within 
the cloud, consistent with the CO morphology from Gra- 
belsky et al. (1988). In either case, these candidates are 
not the powering source of SFC 28. 

5.1.2. Cygnus 0B2 

To ensure that the SSC method can identify known OB 
associations, we use Cygnus 0B2 as a test case. This as- 
sociation, located in the outer Galaxy, has a substantially 
reduced background stellar population in comparison to 
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Figure 2. Spitzer GLIMPSE 8 fim image (left) and the on-sky density diagrams (right) of the 2MASS point sources. The source and 
extinction ranges are indicated above each figure. The green sohd elhpse indicates the location of the SFC from RMIO, while the red dashed 
ellipses indicate the location of the candidate clusters. The color bar indicates the point source density in sources per square arcminute. 
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Figure 3. Spitzer GLIMPSE 8 /im image (left) and the on-sky density diagrams (right) of the 2MASS point sources. The source and 
extinction ranges are indicated above each figure. The color bar and annotations are the same as Figure 2. 
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Figure 4. Spitzer GLIMPSE 8 /im image (left) and the on-sky density diagrams (right) of the 2MASS point sources. The source and 
extinction ranges are indicated above each figure. The color bar and annotations are the same as Figure 2. 
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Figure 5. Spitzer GLIMPSE S/xm image (left) and the on-sky density diagrams (right) of the 2MASS point sources. The source and 
extinction ranges are indicated above each figure. The color bar and annotations are the same as Figure 2. We use the MSX 8 fim image 
for SFC 26 due to Spitzer GLIMPSE coverage limits. 
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Figure 6. Spitzer GLIMPSE 8 fim image (left) and the on-sky density diagrams (right) of the 2MASS point sources. The source and 
extinction ranges are indicated above each figure. The color bar and annotations are the same as Figure 2. We use the MSX 8 fim image 
for SFC 28 due to Spitzer GLIMPSE coverage limits. 
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Figure 7. Spitzer GLIMPSE 8 /im image (left) and the on-sky density diagrams (right) of the 2MASS point sources. The source and 
extinction ranges are indicated above each figure. The color bar and annotations are the same as Figure 2. 
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Figure 8. Spitzer GLIMPSE 8 /im image (left) and the on-sky density diagrams (right) of the 2MASS point sources, 
extinction ranges are indicated above each figure. The color bar and annotations are the same as Figure 2. 
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Figure 9. A comparison between the extinction range of all identified candidate clusters to the kinematic distance of their host SFC from 
RMIO (left) and with the model extinction from Marshall et al. (2006, right). The diagonal is indicated with the gray line on the right, 
and the local distance to extinction relationship (A^/^ = 0.1 mag kpc"-*^) is indicated with the dashed gray line on the left. This relation 
breaks down for distances greater than 6 kpc, as compared to the relation between the candidate and model extinction that remains tight 
for all candidate distances. The connected red ranges indicate the SFCs without a kinematic distance ambiguity resolution. The green 
range indicates the location of the Dragonfish Association properties from Rahman et al. (2011a). As no information about the kinematic 
distance was used in identifying the candidates, the correlation between the candidate and model extinction is not caused by a bias in the 
SSC method. 




Figure 10. A histogram of the ratios of the candidate cluster extinction to the modeled extinction from Marshall et al. (2006) in the 
filled histogram. The outlined histogram is the result of shuffling the candidate cluster extinctions 5000 times, normalized to the number 
of candidates in our sample. The larger scatter of the shuffled histogram provides additional support for the reality of the candidates. 
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Figure 11. NGC 3603 region from 2MASS in J (blue), H (green) and K (red) with the central cluster magnified (inset). The crosses 
indicate the locations of the only 2MASS point sources that pass the required quality cut within 33^^of the cluster center. The orange 
crosses are the point sources that meet the color cut of the cluster, while those in red are the sources that do not. 



the clusters in the inner galaxy. Therefore, the associa- 
tion can be identified using total stellar density without 
extinction cuts, as has been done by Knodlseder (2000). 

Using our method, we easily identify Cygnus 0B2 (Fig- 
ure 12) at an extinction range of 0.5 <Ak < 0.8, with 
position, structure and extinction consistent with the val- 
ues determined by Knodlseder (2000). The typical back- 
ground level in the field is 0.3-0.6 sources arcminute"^, 
approximately 10%-20% of the peak density of the asso- 
ciation. If we choose the boundary of the cluster to be 
the isodensity contour at 1.2 per arcminute"^, the aver- 
age rate of line-of-sight contaminating sources within the 
association boundaries is < 0.28, allowing for high-yield 
extraction of association members from the field. 

5.2. Statistical Significance 

In §5.1, we have discussed the success and sensitivity of 
the SSC method using previously identified clusters and 
associations. However, the problem that has plagued pre- 
vious cluster candidate identification methods has been 
the overwhelming number of false positives. We conduct 
a test to determine the rate of false positive candidate 
identifications through the SSC method. 

We generate a list of 40 artificial SFCs with posi- 
tions, dimensions and position angles randomly chosen 
to match the distribution of actual SFCs on the sky: the 
artificial locations are generated from a uniform random 
distribution between 270° < / < 90° and -1° < 6 < 1°. 
The position angles, semimajor axes and the axial ratios 
were generated by producing linearly interpolated cumu- 
lative distribution functions from the real SFC list. Using 
the SSC method, we attempt to identify "candidate clus- 
ters" for these artificial SFCs, with the same criteria as 
in §4.4. This test was conducted five times. From the five 
lists of artificial SFCs, the rate of identified candidates is 
3.0 ± 0.6 sources, which is a much smaller yield than the 
22 actual SFCs with identified candidates. It should be 
noted that the artificial candidates may be actual clusters 



or associations related to less luminous WMAP sources 
on the sky from Murray & Rahman (2010). 

In addition to the false positive test, the correlation 
between the modelled extinction and the candidate ex- 
tinction (§5; Figure 9) provides strong support for the 
reality of the clusters identified by the SSC method. Fi- 
nally, the spectroscopic confirmation of the Dragonfish 
Association (Rahman et al. 2011b) provides additional, 
circumstantial evidence of the clusters identified by this 
method. These independent lines of evidence indicate 
that most (~ 86%) of our cluster candidates are indeed 
associated with their host SFCs, and are their dominant 
engines of ionization and dynamical power. 

5.3. Testing Cluster Extraction 

We conduct a pair of tests to determine any bias in the 
extracted parameters of the candidate clusters. First, 
we test the accuracy of the number of extracted sources 
as a function of extraction radius. We select a 2MASS 
field centered on (/, b) = (343.11°, -0.50°) with the 
extinction-based color cut 0.6 < A'j^ < 0.7 and insert 
randomly generated clusters. The background level of 
sources in this field is 0.24 ± 0.11 arcminute"^. The 
clusters were generated with 40 visible stars in a 7' ra- 
dius, corresponding to a "true" extraction significance 
of '^3 for the entire radius of the cluster in this field 
and a central peak point source density of 0.8 sources 
arcminute"^. The worst case scenario for a poor extrac- 
tion yield is a shallow density profile; this leads to the 
highest number of stars at large radii that may be missed 
with the selection of a smaller extraction radius. To test 
this worst case scenario, we model the cluster with a uni- 
form stellar density in three dimensions. We insert 1000 
randomly generated clusters into different random loca- 
tions in the field. We extract each of the clusters with 
extraction radii ranging from 0.5' to 20^ 

In Figure 13, we present the results of this simulation. 
The 1-cr extraction boundary corresponds to an extrac- 
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Figure 12. The on-sky density diagram of 2MASS point sources between 0.5 <Ax < 0.^ 
indicates the point source density in sources arcminute"^ 



in the Cygnus OB2 region. The color bar 



tion radius that is ~0.8 times the size of the model clus- 
ter radius, and extracted cluster source numbers that 
are ^80% of the inserted numbers. Arbitrarily larger ex- 
traction radii produce extracted cluster source numbers 
that approach 100% of the inserted numbers, but with 
growing error at larger radii. When a more conservative 
2-cr or 3-cr extraction radius is chosen, the extracted star 
numbers are 50% and 30% of the inserted cluster num- 
bers, underestimating the total cluster population given 
this worst-case radial density model. This indicates that 
the chosen selection boundaries are either correctly esti- 
mating or underestimating the number of stars extracted. 

Next, we conduct a test of the visual cluster identifi- 
cation and extraction method to determine what biases 
exist in the selection of the candidate clusters. For this 
test, we generate artificial clusters with a random dis- 
tribution of positions, number of sources, and density 
profiles, place them in a real 2MASS field, and extract 
their positions, sizes, and number of sources with back- 
ground estimation in the same manner as with the actual 
candidate clusters. 

We generate 200 artificial clusters containing between 
29 and 67 sources. We fix the truncation radius of the 
clusters at 7^ the median of the semimajor axes from 
the candidate clusters. We vary the 2D-density profile 
of clusters using a Sersic probability distribution of the 
form: 

P{R) (X exp(-2(i?/i?T)^/''0 



We represent the number of artificial cluster sources in- 
serted as Ninputi and the number of field stars at that 
location as Ni^g. The number of sources estimated in 
the cluster after background subtraction using the visual 
extraction method is represented as Nguh- 

Of the 200 artificial clusters inserted, 160 were suc- 
cessfully identified and extracted. The centroiding of 
the identified clusters is quite accurate; the mean dif- 
ference between the inserted and extracted central posi- 
tions of the identified test clusters is 1.4±0.9 arcminutes 
and only 7 identified clusters have positional differences 
greater than 3^ The 40 unidentified test clusters are 
typically confused with Galactic structure and the size 
distribution is skewed towards clusters with smaller num- 
bers of points. In addition, the unidentified test clusters 
are skewed towards larger Sersic indices. This indicates 
that the visual selection method is biased against smaller 
clusters and those with peakier density profiles. This is 
consistent with the expectation that we are not sensitive 
to particularly compact clusters. 

From the 160 artificial clusters extracted, we can de- 
termine the accuracy of retrieving the initial cluster 
population. We find that 70% of clusters have a frac- 
tion of extracted sources to the inserted sources, fext = 
N sub /N inputs between 0.6 and 1.0 with a mean value of 
0.84 (Figure 14 inset). The typical measured ES values of 
the artificial candidates range from 2.5 to 5.5. We find no 
strong correlations between ng, N input or Ni^g with 



where Rt is the truncation radius, and is the Sersic 
index. Larger Sersic indices correspond to more centrally 
concentrated distributions, while smaller indices produce 
more shallow profiles. For simplicity in comparison, we 
choose discrete integer values between 1 and 4 for the 
Sersic indices. 

We choose the same 3° 2MASS field h = 343.11°, 
—0.50°), and extinction-based color cut (0.6 < < 
0.7) from the earlier test. This field is free of identified 
candidate clusters, but does contain the stratified Galac- 
tic structure feature similar to that shown in Figure 1. 



However, we find that there is an inverse correlation of 
the ratio of Ninput and Ni^g with f^xt (Figure 14); in the 
small number of cases where we overestimate the size 
of the cluster (by up to a factor of two), these clusters 
are a small fraction of the background level of the region 
{Ninput /J^bg < 0.35). Overall, these tests indicate that 
the method is robust in recovering the original cluster 
population and position, with a tendency to underesti- 
mate the total population size. 



6. MASS & LUMINOSITY ESTIMATES 
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Figure 13. The results of the model cluster extraction simulation. The mean extraction significance (solid line) and mean fraction of 
model stars extracted (dashed line) from all 1000 insertions are plotted in the top frame. The shaded area indicates the error in the fraction 
extracted based on the background variation. The mean point source surface density of all insertions is plotted in the bottom frame (solid 
line), with the background density level (0.24 arcminute"^; dashed line) and 1-a background variation (0.11 arcminute"^; shaded region) 
indicated. The extraction radius (R) is normalized to the radius of the inserted modeled cluster (Rmodel =7'). 



We estimate the physical properties of the candidate 
clusters to assess if the identified candidates can pro- 
duce the observed free-free fiux of their parent WMAP 
source (Table 2). We note that the stated boundaries 
from the candidate identification and extraction method 
are only visually identified based on a clear over density 
from the local background variation (§5). Consequently, 
the extracted clusters will likely exclude the diffuse out- 
skirts of the clusters or associations, especially when the 
background level of contaminating sources is particu- 
larly high. Such diffuse populations can produce signif- 
icant contributions to the total complex population, as 
has been recently observed in the Carina Nebula where 
previously undiscovered OB stars were identified in X- 
ray observations, doubling the massive stellar population 
(Povich et al. 2011). Although we found in the previous 
section that our method is likely to underestimate clus- 
ters' true population, we adopt a conservative approach 
and do not attempt to correct for the extraction effi- 
ciency. 

We estimate the least massive stars that are extracted 
in the candidate clusters by using the observed 2MASS 
Ks-hand magnitude limit from §4.1, with the mean ex- 
tinction range of the candidate and the kinematic dis- 
tance of the SFC from RMIO. In cases where the kine- 
matic distance is ambiguous, we use the distance most 



consistent with the Marshall extinction (see Figure 9). 
We use the absolute magnitude tabulations from Martins 
& Plez (2006) for 0-type stars and from Pickles (1998) 
for all other stars. We also use Martins et al. (2005) 
for the 0-type star mass determinations and Andersen 
(1991) for all other stars. From the mass of the least mas- 
sive star and the number of stars extracted in the candi- 
date cluster {Ntot—^BG in Table 1), we determine a total 
cluster mass (M*) and ionizing luminosity (Qo) integrat- 
ing over an IMF. To maintain consistency with previous 
work, we use the modified version of the Muench et al. 
(2002) IMF from Murray & Rahman (2010). The mass 
of the candidate was estimated by scaling the Muench 
IMF to the number of stars down to the least massive 
star extracted, and then integrating over the mass func- 
tion to determine the total mass. The ionizing luminosity 
was determined by using the mass-to-ionizing luminosity 
relationship from Murray & Rahman (2010). Note that 
we do not make any correction for the multiplicity of 
sources. 

Finally, we determine the expected free-free fiux contri- 
bution of the candidate at = 90 GHz. We also visually 
classify the candidates based on their location with re- 
spect to the identified SFC and the 8 fim morphology. 
We indicate that they are either "centrally-located" if 
they appear towards the center of a shell-like structure 
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Figure 14. The results of the cluster extraction test using artificial clusters in a Galactic 2MASS field centered on (/, b) = (343.11°, 
-0.5°). The ratio of the extracted point sources to the artificially inserted point sources, fext, vs. the ratio of the inserted point sources to 
the number of field sources in the background. We present the histogram of fext of test clusters in the inset. The visual extraction method 
generally estimates a cluster population of between 60% to 100% of the inserted cluster population. The method only estimates a larger 
number of points than inserted into the cluster when the cluster is small in comparison to the number of background sources. 



or the entire SFC, or "shell- located" if they appear on 
the edge of a shell or SFC. 

In Table 1, Ngub is the number of sources in the can- 
didate cluster once the background level is subtracted 
{Ntot — Nbg)' We include the Dragonfish Association 
properties from Rahman et al. (2011a). For the WMAP 
sources with multiple identified candidate clusters, we in- 
clude totals for the cluster masses, ionizing luminosities 
and free- free emission fluxes. 

We note that the latest stellar type of all extracted 
sources is between O- to A-type stars. All of these hot 
stars will have similar Rayleigh- Jeans colors in the NIK. 
Therefore, we do not expect cluster members to be ex- 
cluded based on having colors that fall outside of the 
color cut. 

The total cluster masses of the candidates range from 
150 to 10^ M©; perhaps not surprisingly, some appear to 
be low- mass objects unrelated to their host regions. In 
these cases, the estimated ionizing luminosity and free- 
free emission flux are overestimated as the IMF is poorly 
sampled on the upper end and the mass-to-ionizing lumi- 
nosity conversion breaks down. This has been discussed 
in detail by Kennicutt et al. (1989). Therefore the free- 
free flux contribution from these clusters will be much 
smaller than those tabulated since the mass-to-ionizing 



luminosity relation is unsuitable for a poorly sampled 
IMF. By ionizing luminosities of log Qo ^ 50 or clus- 
ter masses greater than 1600 M©, > 10 0-type stars are 
required to ionize the complex and the IMF should be 
sufficiently well sampled to allow the accurate use of the 
mass-to-ionizing luminosity relationship (Martins & Plez 
2006). 

We find the sum of the estimated free-free emission 
fluxes to be roughly consistent with the WMAP mea- 
surement for G34, G37, G298, G327, and G337. For G30 
and G49, the identified candidates appear to be centrally 
located, but the smaller-than-expected mass, ionizing lu- 
minosity, and free-free flux most likely come from a dif- 
fuse component of the cluster/association. Should these 
candidates be confirmed, an in-depth investigation, pos- 
sibly in the X-rays, will be required to identify the ex- 
tended or diffuse population. For G283 and G327, only 
shell-located regions are identified. In the case of G327, 
the lack of sharp shell structure in the bubble in 8 //m 
and 2MASS stellar overdensity structure (see Figure 7, 
top panel) may indicate that the original driving cluster 
of the bubble has evolved beyond its ionizing luminous 
stage, and the ionizing population of the complex is a 
more diffuse, triggered population. In source G291, we 
identify two possible triggered clusters; however the dom- 
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inant cluster in the region, NGC3603, has been previ- 
ously identified and was not detected by the SSC method 
(see §5.1.1). 

In some cases, such as G332, multiple candidates are 
identified at the same (or reasonably close) distances and 
extinction ranges. These candidates are likely associ- 
ated and may represent more widespread star formation 
across a molecular cloud or complex. Accordingly, the 
total embedded cluster mass in these situations would 
be the sum of the individual "clustered" regions, plus 
the diffuse component that we do not recover from the 
extraction process. 

The total expected free-free emission luminosities of 
the candidates in G24 and G332 substantially exceed 
the measured fiux from WMAP. There are at least six 
possibilities to account for this: (1) one or more of the 
candidates within the WMAP source are false positives, 
(2) the extraction has overestimated the number of mem- 
ber stars by a substantial fraction, (3) the kinematic dis- 
tance to the SFC is closer than the actual distance, (4) 
the cluster is older than 3.9 Myr, the age at which the 
most massive stars have left the main sequence (Bressan 
et al. 1993), (5) a higher- than- average fraction of the ion- 
izing photons are absorbed by dust, and (6) an unusually 
large fraction of the ionizing photons escape the region, 
or escape the Galaxy. To determine whether it is case 
(1) or (2), follow-up astrometric or spectral confirmation 
of the cluster is required. In case (3), the distances to 
the ionizing sources can be unambiguously determined 
though trigonometric parallax, if strong maser sources 
associated with the H II regions are available. In case 
(4), the evolutionary state of the candidate requires ac- 
curate determination of the spectral type of candidate 
sources to determine the earliest stars in the cluster and 
their individual evolutionary state. If this is the case, 
these clusters may harbor young core-collapse remnants 
of very massive stars, such as black holes or magnetars. 

With these caveats, we find candidates to central pow- 
ering clusters that can account for the observed WMAP 
free-free emission flux within G24, G30, G34, G37, G49, 
G298, G332, and G337. Of the original WMAP sources 
investigated in RMIO, only GIO does not have any candi- 
date clusters identified through this method, either cen- 
trally located or on the shell. 

7. COMPLETING THE UPPER END OF THE GALACTIC 
CLUSTER MASS FUNCTION 

Our knowledge of the upper end of the Galactic cluster 
mass function (CMF) has long been recognized as incom- 
plete. This has been established through comparison to 
galaxies with similar masses and star formation rates as 
the Milky Way (Larsen & Richtler 2000; Larsen 2009), 
and comparing the known cluster/association population 
with the inferred H II mass function (McKee & Williams 
1997; Hanson & Popescu 2008). These comparisons show 
that up to 90% of young, massive clusters and OB asso- 
ciations are yet to be found. 

Table 3 presents the current census of known massive, 
young (< 4 Myr) clusters and OB associations in the 
Galaxy. This census includes all known clusters and asso- 
ciations with masses similar to or greater than 10^ Mq, in 
addition to the massive candidates from this work. The 
age limit of 4 Myrs ensures that even the most massive 
stars, which produce the bulk of the ionizing luminosity. 



are still on (or near) the main sequence (Murray & Rah- 
man 2010). This effect is prominently seen in the case 
of Westerlund 1, where despite the large cluster mass 
(> 10^- M0) and rich population of Wolf-Rayet and 
late-0 type stars (Brandner et al. 2008), the cluster is 
not detected as a prominent free-free emission source in 
WMAP (Murray & Rahman 2010). If the candidates 
from this work are confirmed with the minimum masses 
estimated, the census of young, massive clusters and OB 
associations is doubled with 7 new candidates. 

Notably, the age restriction of the census excludes 
the red supergiant clusters recently discovered with ages 
> 10 Myrs and masses > 10"^ M© (Figer et al. 2006; 
Davies et al. 2007; Clark et al. 2009). The massive stel- 
lar component of these clusters has entirely evolved off 
the main sequence, and consequently the clusters have an 
insignificant ionizing luminosity. This is corroborated by 
the lack of any reprocessed free-free emission towards the 
clusters from the WMAP free-free emission maps (Mur- 
ray & Rahman 2010). 

7.1. Estimates of Completeness 

The completeness of the census can be estimated 
through CMFs from previous work for other galaxies in 
the visible wavebands, such as the Antennae and M83 
(Fan et al. 2009; Chandar et al. 2010). Ivanov et al. 
(2010) estimate that the Milky Way contains > 81 =b 21 
clusters with masses of 10^ — 10^ Mq and ages up to 
~25 Myr, assuming all clusters within 6 kpc are known 
(Piskunov et al. 2008). This estimate can be scaled to 
our 4 Myr age limit using the age distribution from Chan- 
dar et al. (2010, ^ oc r~^) with a minimum age of 
1.3 Myr based on the youngest clusters observed. The 
scaled estimate is 31 clusters in the Milky Way, of which 
Larsen (2009) would predict that 8 are bound. This is 
consistent with the estimate made by McKee & Williams 
(1997), who estimate the number of OB associations with 
M > 10"^ M0 (or Qo > 10^^-^ s-i) to be 29. 

Another independent measure of completeness is a 
comparison to the H II region luminosity functions of 
other galaxies. We compare to the H II region luminosity 
functions from Kennicutt et al. (1989), using a minimum 
Ha luminosity of L(Ha) > 10^^-^ erg s~^ which is sim- 
ilar to the 10^ M0 cutoff. In the most analogous cases 
to the Milky Way in terms of morphological class, NGC 
6384 (an Sb galaxy) contains 38 sufficiently luminous H II 
regions, while NGC 7741 (an SBc galaxy) contains 17. 
Smith & Kennicutt (1989) show that the H II luminos- 
ity function of the Milky Way is comparable to those of 
Sb-c galaxies, indicating that using the Sb-c galaxies as 
a guide is a valid estimation of the total young, massive 
clusters in the Galaxy. 

Both of these estimates of the total number of mas- 
sive (> 10^ M0) clusters/associations in the Galaxy are 
comparable to 15, the number of associations in our cen- 
sus, of which seven are presented here (Table 3). Since 
the estimates of the number of young, massive clusters 
expected is between 17-38, we infer from this that our 
knowledge of young, massive clusters in the Milky Way 
is now about half complete. 

7.2. Remaining Galactic Plane Targets 
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This work is based on the luminosity-selected sample 
of SFCs from RMIO on the basis that their ionizing lu- 
minosities are from young, massive clusters hosted by 
the SFCs. However, this work was not a comprehensive 
search through the entire Galactic plane. We can deter- 
mine the likely regions of young, massive clusters that re- 
main hidden by comparing our census to the catalog of gi- 
ant H II regions from Conti & Crowther (2004). As noted 
in RMIO, previously identified individual regions closely 
packed together are likely associated in larger complexes 
(see §2.1 in RMIO for a detailed discussion). For the pur- 
pose of this comparison, we group the giant H II regions 
from Conti & Crowther (2004) into 27 degree-sized zones 
in the Galactic plane. Each Galactic plane zone consists 
of 1-7 giant H II regions. 

Of the 27 Galactic plane zones, young massive (> 10^ 
Mq) clusters or associations have not been identified in 
18. Within the 18 zones without young massive clusters, 
6 have previously known powering clusters with masses 
less than 10^ M©, and 5 have candidates identified in this 
work also less than 10^ Mq. We list the zones lacking a 
young, massive cluster in Table 4, tabulating the Galactic 
Plane Region (in Galactic Longitude), their kinematic 
distance(s), and their names where applicable. For the 5 
zones with candidates identified in this work, they have 
lower mass limits > 10^*^ M©. Because our search is not 
sensitive to the diffuse outskirts of clusters/associations, 
it is possible that the candidates in the 5 zones have 
masses larger than 10^ M©. Further observations and 
analysis are required to better constrain the total masses 
of these clusters. 

The advent of new Galactic plane NIR surveys, such 
as the UKIRT Infrared Deep Sky Survey (UKIDSS; 
Lawrence et al. 2007) and the Vista Variables in the Via 
Lactea Survey (VVV; Minniti et al. 2010), will enable 
more sensitive probes for clustering than has been pos- 
sible with 2MASS. Both UKIDSS and VVV are signif- 
icantly deeper than 2MASS (~4 magnitudes in the K- 
band), with a much smaller point source function (~4 
times smaller) . Consequently, applying the SSC method 
in these surveys would enable the detection of more 
highly extinguished and less massive clusters, since the 
greater sensitivity allows for more highly reddened and 
intrinsically less luminous stars to be detected. In ad- 
dition, these surveys would better probe more compact 
clusters, where the 2MASS point source function size has 
rendered them confused. However, two challenges will 
arise in using deeper surveys; because they are more sen- 
sitive to the less luminous members of a cluster, the as- 
sumption of Ray leigh- Jeans NIR colors will break down 
and a more complex search strategy following the ex- 
pected color of the extinguished main-sequence will be 
required. Secondly, the increase in sensitivity will also 
increase the number of field sources; to better separate 
field and cluster sources, the explicit use of the luminos- 
ity information will be required. However, the potential 
in these new surveys is not limited to finding the remain- 
ing Galactic massive cluster and association population, 
but also better characterizing the candidate clusters and 
associations detected in this work. 

We note that this work searches for candidate clus- 
ters with extinctions of Ak < 2.0, while the extinction 
model from §5 indicates that significant portions of the 
Galactic plane will have extinctions of greater than that. 



This includes a number of candidates from this work if 
you accept the far kinematic distance from the SFC. The 
currently used 2MASS data is not sufficiently populated 
with highly-reddened stars to allow the SSC work due at 
high extinctions. However, the deeper NIR surveys will 
allow searches for more highly extinguished clusters and 
associations. 

Twelve of the 27 Galactic plane zones have not been 
probed in this work, leaving a substantial discovery space 
for the SSC method to be applied. Recently, Lee et al. 
(2012) have identified SFCs in the remaining WMAP 
free-free sources not originally covered in RMIO, which 
provides a new sample of morphologically-identified re- 
gions to probe. Investigating this remaining SFCs may 
enable the completion of the entire upper end of the 
Galactic CMF. 

8. CONCLUSIONS 

We have used the 2MASS point-source catalog to 
search for massive clusters and association candidates 
within the SFCs from WMAP, looking for the bright- 
est stars, taking into account the line of sight extinction. 
Including the previous work of Rahman et al. (2011a), 
we find that 22 of the 40 SFCs host a candidate clus- 
ter that has been identified through our method, either 
central to the SFC or on its periphery. The extracted 
candidates have semimajor axes of 3' - 26'. The average 
extraction significance oi the candidates is 2.4. The can- 
didates with smaller extraction significances are selected 
based primarily on morphology. 

We discuss the case of two previously identified clus- 
ters/associations: NGC 3603, which is unidentified 
through our method due to the compact nature of the 
cluster, and Cygnus 0B2, which we readily identify and 
whose properties we accurately constrain. We find the 
false positive rate to be 3.0 ±0.6 out of 22 identified. We 
test the visual identification method and find it to be 
robust in determining the population of the cluster. The 
strong correlation between the candidate extinctions and 
the modelled extinction based on the SFC distances pro- 
vides strong support for the reality of the clusters. This is 
reinforced by the spectroscopic confirmation of the Drag- 
onfish Association, originally identified the same method 
of Rahman et al. (2011b). 

We estimate the masses and luminosities of the can- 
didates, showing that the candidates can account for 
the observed WMAP free-free fiux for most of the in- 
vestigated sources. Of the WMAP sources investigated, 
only one (GIO) lacks any candidate clusters or associa- 
tions. Finally, we provide a Galactic census of the young, 
massive clusters and OB associations. With the newly 
identified candidates from this work, we estimate that 
our knowledge of the young, massive cluster population 
is about half complete. We indicate locations on the 
Galactic plane possibly hosting the remaining unidenti- 
fied massive clusters and associations. 

Future astrometric, spectroscopic and/or X-ray follow- 
up observations can confirm the candidates, measure 
their physical properties and determine their environ- 
mental impact. With such confirmation, the candidate 
clusters and associations within the most luminous SFCs 
in the Galaxy will enable the more accurate determina- 
tion of the upper end of the Galactic CMF. 
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^ Candidate cluster properties from Rahman et al. (2011a) 

^ Where a kinematic distance ambiguity exists, we list the distance most consistent with candidate extinction first, and the less consistent 
distance in parentheses. 

^ Model extinction from Marshall et al. (2006) 
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Table 3 

Census of Massive (> 10"* M©), Young (< 4 Myr) Clusters & OB Associations in the 

Galaxy 
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>4.0 


2.8 


RCW49, Westerlund2 


(9) 


G287 


>4.3 


2.3 


Carina 


(7) 


G291 


4.2 


6.0 


NGC 3603 


(4) 


G298 


5.0 


9.7 


Dragonfish 


(6) 


G332 


>4.1* 


3.2 


SFC 34a,b 


(1) 


G337 


>4.8" 


10.9 


SFC 37 


(1) 



References. — (1) This work; (2) Conti & Blum (2002); (3) Figer et al. (1999); (4) Harayama 
et al. (2008); (5) Paumard et al. (2006); (6) Rahman et al. (2011a); (7) Smith & Brooks (2007); 
(8) Wright et al. (2010); (9) Ascenso et al. (2007); 



Note. — Wolff et al. (2007) use an IMF-sensitive extrapolation over a small mass range to 
determine the masses of IC 1805 and NGC 6611. The extrapolated mass of NGC 6611 is 10'^ '* 
Mq, in contrast to more recent work by Bonatto et al. (2006) that revise the minimum cluster 
mass down to 10^'^ Mq. In both of these cases, the free-free luminosities of the regions from 
Murray & Rahman (2010) are inconsistent with young clusters with M> lO'* Mq, indicating 
smaller masses. Consequently, we exclude both of these regions from this census. 

Candidate Cluster/ Association from this work. 
^ Unresolved kinematic distance ambiguity. The distance indicated is the most consistent with 
the model extinction to the candidate. 
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Table 4 

Giant H II Regions Lacking Known Young Massive Clusters 



Galactic Region Distance Gomponents (kpc) Names 



G3 


14.3, 14.6 


AMWW34, AMWW35 


G6* 


2.8 


M8 


G8 


13.5 




GIO* 


4.5, 15 


W31 


G15* 


2.4 


M17 


G20 


11.8 




G30# 


6.3 


W43, SFG 17 


G49# 


5.7 


W51, SFG 25 


G70 


8.6 


W58A 


G133* 


4.2 


W3 


G274 


6.4 


RGW 42 


G289 


7.9 




G305* 


3.5 




G320 


11.5, 12.6 




G327# 


3.6 


RGW97, SFG 33 


G347* 


7.9 


[DBS2003] 179 


G351 


13.7 





Note. — All regions and distance components are from Conti & Crowther 
(2004). 

Regions with known powering clusters, but significantly less massive than 10^ 

Mo 

^ Regions with candidate powering clusters, but with minimum stellar masses 
less than 10^ Mq 



